Handling Structural Ambiguity in a Knowledge-Based Information Retrieval System
نویسندگان
چکیده
This paper presents a strategy to handle syntactic ambiguity in a theoretically motivated fashion following general linguistic principles. This strategy, which is called underspecification, was implemented in a Natural Language Engine (NLE) for automatic information extraction, called ELSA (an acronym for English Language Semantic Analyser), which was developed at the Department of Language & Speech of the University of Nijmegen. The crucial idea of the strategy is that, in case of ambiguity, the NLE should know what option to choose and when to choose it. Until that moment the analysis remains underspecified, i.e. only one derivation is produced. At present time, the NLE in question is adapted to serve as the linguistic module of a knowledge-based Information Retrieval System, called Condorcet, being developed at theUniversity of Twente, for documents on the fields ofmechanical properties of engineering ceramics as a subfield of engineering, and epilepsy as a subfield of medicine. In this paper we will show how a theory-driven NLE will make a substantial contribution to (semi)automatic information retrieval, making use of the AGFL system. The authors are greatly indebted to Nicolaas J.I. Mars and Paul E. van der Vet, the initiators of the Condorcet project, for their substantial contribution to this article. Chapter
منابع مشابه
Performance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature
Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...
متن کاملBehavioral Considerations in Developing Web Information Systems: User-centered Design Agenda
The current paper explores designing a web information retrieval system regarding the searching behavior of users in real and everyday life. Designing an information system that is closely linked to human behavior is equally important for providers and the end users. From an Information Science point of view, four approaches in designing information retrieval systems were identified as system-...
متن کاملKnowledge Sources for Textual CBR Applications
Textual CBR applications address issues that have traditionally been dealt with in the Information Retrieval community, namely the handling of textual documents. As CBR is a knowledge-based technique, the question arises where items of knowledge may come from and how they might contribute to the implementation of a Textual CBR system. In this paper, we will show how various pieces of knowledge ...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کامل